-
Notifications
You must be signed in to change notification settings - Fork 244
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ginkgo parallel test node CI timeout debugging #3792
Ginkgo parallel test node CI timeout debugging #3792
Conversation
@amitkrout: The label(s) In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Codecov Report
@@ Coverage Diff @@
## master #3792 +/- ##
=======================================
Coverage 44.29% 44.29%
=======================================
Files 139 139
Lines 13400 13400
=======================================
Hits 5936 5936
Misses 6882 6882
Partials 582 582 Continue to review full report at Codecov.
|
/test all |
c1a96a4
to
85c4e7a
Compare
The last successfully test run was on ginkgo --debug flag could help much, however there is an ongoing pr ginkgo onsi/ginkgo#461 to get proper debug output for parellel test note that stuck/failed due to wrong signal. So its hard to find out the reason for failure with |
85c4e7a
to
2ddf0a6
Compare
@amitkrout can we track this in a separate issue. It seems this issue needs good amount of time to investigate. However I suspect it be happening with devfile watch. One more hit - https://prow.ci.openshift.org/view/gs/origin-ci-test/pr-logs/pull/openshift_odo/3763/pull-ci-openshift-odo-master-v4.5-integration-e2e/1296528880043036672#1:build-log.txt%3A682 |
On travis also:
|
Is it always failing on the same test spec? |
None of the time i am able to reproduce out of 20 tires locally. Seems its specific to CI failure |
/test all |
It looks like it started with this PR #3733 Could something in this PR cause this? |
Yes, i am able to reproduce it locally with test-cmd-devfile-log after 3 tries
|
ping @adisky |
@amitkrout: The following test failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
cause identified and will be resolved via #3808 |
@amitkrout: Closed this PR. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
What type of PR is this?
/kind failing-test
What does does this PR do / why we need it:
Recently we have
Ginkgo timed out waiting for all parallel nodes to report back!
error in most of our periodic job and in pr job too in devfile integration test.May be this happens due to bad signal handling between ginkgo and the spec or may be test stuck in between. So adding
-debug
flag to get more information on the failure node.As part of test timeout is concerned, the current timeout
TIMEOUT ?= 7200s
is enough for devfile test run. Anyway if we find in CI debug log 2hrs is not enough then we will increase it.NOTE: I have tried locally and none of the time i am able to reproduce it.
Which issue(s) this PR fixes:
Fixes NA
PR acceptance criteria:
Unit test : NA
Integration test : NA
Documentation : NA
How to test changes / Special notes to the reviewer:
debug flag should report correct information on the failure node